69 research outputs found

    Linking named entities to Wikipedia

    Get PDF
    Natural language is fraught with problems of ambiguity, including name reference. A name in text can refer to multiple entities just as an entity can be known by different names. This thesis examines how a mention in text can be linked to an external knowledge base (KB), in our case, Wikipedia. The named entity linking (NEL) task requires systems to identify the KB entry, or Wikipedia article, that a mention refers to; or, if the KB does not contain the correct entry, return NIL. Entity linking systems can be complex and we present a framework for analysing their different components, which we use to analyse three seminal systems which are evaluated on a common dataset and we show the importance of precise search for linking. The Text Analysis Conference (TAC) is a major venue for NEL research. We report on our submissions to the entity linking shared task in 2010, 2011 and 2012. The information required to disambiguate entities is often found in the text, close to the mention. We explore apposition, a common way for authors to provide information about entities. We model syntactic and semantic restrictions with a joint model that achieves state-of-the-art apposition extraction performance. We generalise from apposition to examine local descriptions specified close to the mention. We add local description to our state-of-the-art linker by using patterns to extract the descriptions and matching against this restricted context. Not only does this make for a more precise match, we are also able to model failure to match. Local descriptions help disambiguate entities, further improving our state-of-the-art linker. The work in this thesis seeks to link textual entity mentions to knowledge bases. Linking is important for any task where external world knowledge is used and resolving ambiguity is fundamental to advancing research into these problems

    Frontotemporal dementia and its subtypes: a genome-wide association study

    Get PDF
    SummaryBackground Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three genes—MAPT, GRN, and C9orf72—have been associated with FTD. We sought to identify novel genetic risk loci associated with the disorder. Methods We did a two-stage genome-wide association study on clinical FTD, analysing samples from 3526 patients with {FTD} and 9402 healthy controls. To reduce genetic heterogeneity, all participants were of European ancestry. In the discovery phase (samples from 2154 patients with {FTD} and 4308 controls), we did separate association analyses for each {FTD} subtype (behavioural variant FTD, semantic dementia, progressive non-fluent aphasia, and {FTD} overlapping with motor neuron disease FTD-MND), followed by a meta-analysis of the entire dataset. We carried forward replication of the novel suggestive loci in an independent sample series (samples from 1372 patients and 5094 controls) and then did joint phase and brain expression and methylation quantitative trait loci analyses for the associated (p<5 × 10−8) single-nucleotide polymorphisms. Findings We identified novel associations exceeding the genome-wide significance threshold (p<5 × 10−8). Combined (joint) analyses of discovery and replication phases showed genome-wide significant association at 6p21.3, \{HLA\} locus (immune system), for rs9268877 (p=1·05 × 10−8; odds ratio=1·204 95% \{CI\} 1·11–1·30), rs9268856 (p=5·51 × 10−9; 0·809 0·76–0·86) and rs1980493 (p value=1·57 × 10−8, 0·775 0·69–0·86) in the entire cohort. We also identified a potential novel locus at 11q14, encompassing RAB38/CTSC (the transcripts of which are related to lysosomal biology), for the behavioural \{FTD\} subtype for which joint analyses showed suggestive association for rs302668 (p=2·44 × 10−7; 0·814 0·71–0·92). Analysis of expression and methylation quantitative trait loci data suggested that these loci might affect expression and methylation in cis. Interpretation Our findings suggest that immune system processes (link to 6p21.3) and possibly lysosomal and autophagy pathways (link to 11q14) are potentially involved in FTD. Our findings need to be replicated to better define the association of the newly identified loci with disease and to shed light on the pathomechanisms contributing to FTD. Funding The National Institute of Neurological Disorders and Stroke and National Institute on Aging, the Wellcome/MRC Centre on Parkinson's disease, Alzheimer's Research UK, and Texas Tech University Health Sciences Center

    Genome-wide analyses as part of the international FTLD-TDP whole-genome sequencing consortium reveals novel disease risk factors and increases support for immune dysfunction in FTLD

    Get PDF
    Frontotemporal lobar degeneration with neuronal inclusions of the TAR DNA-binding protein 43 (FTLD-TDP) represents the most common pathological subtype of FTLD. We established the international FTLD-TDP whole genome sequencing consortium to thoroughly characterize the known genetic causes of FTLD-TDP and identify novel genetic risk factors. Through the study of 1,131 unrelated Caucasian patients, we estimated that C9orf72 repeat expansions and GRN loss-of-function mutations account for 25.5% and 13.9% of FTLD-TDP patients, respectively. Mutations in TBK1 (1.5%) and other known FTLD genes (1.4%) were rare, and the disease in 57.7% of FTLD-TDP patients was unexplained by the known FTLD genes. To unravel the contribution of common genetic factors to the FTLD-TDP etiology in these patients, we conducted a two-stage association study comprising the analysis of whole-genome sequencing data from 517 FTLD-TDP patients and 838 controls, followed by targeted genotyping of the most associated genomic loci in 119 additional FTLD-TDP patients and 1653 controls. We identified three genome-wide significant FTLD-TDP risk loci: one new locus at chromosome 7q36 within the DPP6 gene led by rs118113626 (pvalue=4.82e-08, OR=2.12), and two known loci: UNC13A, led by rs1297319 (pvalue=1.27e-08, OR=1.50) and HLA-DQA2 led by rs17219281 (pvalue=3.22e-08, OR=1.98). While HLA represents a locus previously implicated in clinical FTLD and related neurodegenerative disorders, the association signal in our study is independent from previously reported associations. Through inspection of our whole genome sequence data for genes with an excess of rare loss-of-function variants in FTLD-TDP patients (n≥3) as compared to controls (n=0), we further discovered a possible role for genes functioning within the TBK1-related immune pathway (e.g. DHX58, TRIM21, IRF7) in the genetic etiology of FTLD-TDP. Together, our study based on the largest cohort of unrelated FTLD-TDP patients assembled to date provides a comprehensive view of the genetic landscape of FTLD-TDP, nominates novel FTLD-TDP risk loci, and strongly implicates the immune pathway in FTLD-TDP pathogenesis

    A C6orf10/LOC101929163 locus is associated with age of onset in C9orf72 carriers.

    Get PDF
    The G4C2-repeat expansion in C9orf72 is the most common known cause of amyotrophic lateral sclerosis and frontotemporal dementia. The high phenotypic heterogeneity of C9orf72 patients includes a wide range in age of onset, modifiers of which are largely unknown. Age of onset could be influenced by environmental and genetic factors both of which may trigger DNA methylation changes at CpG sites. We tested the hypothesis that age of onset in C9orf72 patients is associated with some common single nucleotide polymorphisms causing a gain or loss of CpG sites and thus resulting in DNA methylation alterations. Combined analyses of epigenetic and genetic data have the advantage of detecting functional variants with reduced likelihood of false negative results due to excessive correction for multiple testing in genome-wide association studies. First, we estimated the association between age of onset in C9orf72 patients (n = 46) and the DNA methylation levels at all 7603 CpG sites available on the 450 k BeadChip that are mapped to common single nucleotide polymorphisms. This was followed by a genetic association study of the discovery (n = 144) and replication (n = 187) C9orf72 cohorts. We found that age of onset was reproducibly associated with polymorphisms within a 124.7 kb linkage disequilibrium block tagged by top-significant variation, rs9357140, and containing two overlapping genes (LOC101929163 and C6orf10). A meta-analysis of all 331 C9orf72 carriers revealed that every A-allele of rs9357140 reduced hazard by 30% (P = 0.0002); and the median age of onset in AA-carriers was 6 years later than GG-carriers. In addition, we investigated a cohort of C9orf72 negative patients (n = 2634) affected by frontotemporal dementia and/or amyotrophic lateral sclerosis; and also found that the AA-genotype of rs9357140 was associated with a later age of onset (adjusted P = 0.007 for recessive model). Phenotype analyses detected significant association only in the largest subgroup of patients with frontotemporal dementia (n = 2142, adjusted P = 0.01 for recessive model). Gene expression studies of frontal cortex tissues from 25 autopsy cases affected by amyotrophic lateral sclerosis revealed that the G-allele of rs9357140 is associated with increased brain expression of LOC101929163 (a non-coding RNA) and HLA-DRB1 (involved in initiating immune responses), while the A-allele is associated with their reduced expression. Our findings suggest that carriers of the rs9357140 GG-genotype (linked to an earlier age of onset) might be more prone to be in a pro-inflammatory state (e.g. by microglia) than AA-carriers. Further, investigating the functional links within the C6orf10/LOC101929163/HLA-DRB1 pathway will be critical to better define age-dependent pathogenesis of frontotemporal dementia and amyotrophic lateral sclerosis

    The genetic architecture of the human cerebral cortex

    Get PDF
    The cerebral cortex underlies our complex cognitive capabilities, yet little is known about the specific genetic loci that influence human cortical structure. To identify genetic variants that affect cortical structure, we conducted a genome-wide association meta-analysis of brain magnetic resonance imaging data from 51,665 individuals. We analyzed the surface area and average thickness of the whole cortex and 34 regions with known functional specializations. We identified 199 significant loci and found significant enrichment for loci influencing total surface area within regulatory elements that are active during prenatal cortical development, supporting the radial unit hypothesis. Loci that affect regional surface area cluster near genes in Wnt signaling pathways, which influence progenitor expansion and areal identity. Variation in cortical structure is genetically correlated with cognitive function, Parkinson's disease, insomnia, depression, neuroticism, and attention deficit hyperactivity disorder

    A C6orf10/LOC101929163 locus is associated with age of onset in C9orf72 carriers

    Get PDF
    The G4C2-repeat expansion in C9orf72 is the most common known cause of amyotrophic lateral sclerosis and frontotemporal dementia. The high phenotypic heterogeneity of C9orf72 patients includes a wide range in age of onset, modifiers of which are largely unknown. Age of onset could be influenced by environmental and genetic factors both of which may trigger DNA methylation changes at CpG sites. We tested the hypothesis that age of onset in C9orf72 patients is associated with some common single nucleotide polymorphisms causing a gain or loss of CpG sites and thus resulting in DNA methylation alterations. Combined analyses of epigenetic and genetic data have the advantage of detecting functional variants with reduced likelihood of false negative results due to excessive correction for multiple testing in genome-wide association studies. First, we estimated the association between age of onset in C9orf72 patients (n = 46) and the DNA methylation levels at all 7603 CpG sites available on the 450 k BeadChip that are mapped to common single nucleotide polymorphisms. This was followed by a genetic association study of the discovery (n = 144) and replication (n = 187) C9orf72 cohorts. We found that age of onset was reproducibly associated with polymorphisms within a 124.7 kb linkage disequilibrium

    Brain volumetric deficits in MAPT mutation carriers: a multisite study

    Get PDF
    Objective: MAPT mutations typically cause behavioral variant frontotemporal dementia with or without parkinsonism. Previous studies have shown that symptomatic MAPT mutation carriers have frontotemporal atrophy, yet studies have shown mixed results as to whether presymptomatic carriers have low gray matter volumes. To elucidate whether presymptomatic carriers have lower structural brain volumes within regions atrophied during the symptomatic phase, we studied a large cohort of MAPT mutation carriers using a voxelwise approach. Methods: We studied 22 symptomatic carriers (age 54.7 ± 9.1, 13 female) and 43 presymptomatic carriers (age 39.2 ± 10.4, 21 female). Symptomatic carriers’ clinical syndromes included: behavioral variant frontotemporal dementia (18), an amnestic dementia syndrome (2), Parkinson’s disease (1), and mild cognitive impairment (1). We performed voxel-based morphometry on T1 images and assessed brain volumetrics by clinical subgroup, age, and mutation subtype. Results: Symptomatic carriers showed gray matter atrophy in bilateral frontotemporal cortex, insula, and striatum, and white matter atrophy in bilateral corpus callosum and uncinate fasciculus. Approximately 20% of presymptomatic carriers had low gray matter volumes in bilateral hippocampus, amygdala, and lateral temporal cortex. Within these regions, low gray matter volume

    Mendelian randomization implies no direct causal association between leukocyte telomere length and amyotrophic lateral sclerosis

    Get PDF
    Funder: QingLan Research Project of Jiangsu for Outstanding Young TeachersFunder: Project funded by Postdoctoral Science Foundation of Xuzhou Medical UniversityFunder: Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD) for Xuzhou Medical UniversityAbstract: We employed Mendelian randomization (MR) to evaluate the causal relationship between leukocyte telomere length (LTL) and amyotrophic lateral sclerosis (ALS) with summary statistics from genome-wide association studies (n = ~ 38,000 for LTL and ~ 81,000 for ALS in the European population; n = ~ 23,000 for LTL and ~ 4,100 for ALS in the Asian population). We further evaluated mediation roles of lipids in the pathway from LTL to ALS. The odds ratio per standard deviation decrease of LTL on ALS was 1.10 (95% CI 0.93–1.31, p = 0.274) in the European population and 0.75 (95% CI 0.53–1.07, p = 0.116) in the Asian population. This null association was also detected between LTL and frontotemporal dementia in the European population. However, we found that an indirect effect of LTL on ALS might be mediated by low density lipoprotein (LDL) or total cholesterol (TC) in the European population. These results were robust against extensive sensitivity analyses. Overall, our MR study did not support the direct causal association between LTL and the ALS risk in neither population, but provided suggestive evidence for the mediation role of LDL or TC on the influence of LTL and ALS in the European population
    • …
    corecore